"MedTrinity-25M" is a large-scale multimodal medical dataset released by the UCSC-VLAA team, containing 25 million medical images along with detailed annotations. This dataset features multi-granularity annotations, supports the training of multimodal large medical models, and involves a complex construction process that includes data processing, metadata extraction, region of interest localization, and the collection of medical knowledge. Detailed descriptions are generated using large-scale language models to enhance data usability. The dataset was officially released on July 21, 2024, accompanied by a pre-trained model.